CDS

Accession Number TCMCG011C14077
gbkey CDS
Protein Id XP_021899029.1
Location complement(join(487852..487968,488055..488147,488308..488428,488926..489068,489377..489458,489760..489971,490352..490482,490570..490737,490855..490974,491259..491364,491445..491459))
Gene LOC110815513
GeneID 110815513
Organism Carica papaya

Protein

Length 435aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022043337.1
Definition putative protease Do-like 14 [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category O
Description Trypsin-like serine protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03110        [VIEW IN KEGG]
KEGG_ko ko:K08669        [VIEW IN KEGG]
ko:K08784        [VIEW IN KEGG]
EC 3.4.21.108        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04210        [VIEW IN KEGG]
ko04214        [VIEW IN KEGG]
ko04215        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
map04214        [VIEW IN KEGG]
map04215        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGCTATTTTTTGAAAAGAGCGTTGTTATCTTCTCCAAGAAATTCTTCTCTCATCCCGGTCGTAGCTGTTGCTGTCGCGGGATCGGGTCTATTTTATGCTAAGAGCAACCCGGATTCCTTCACAAGGGTTTCTCTGTTGATCCCTGCGCCATTTCTTGAATCTCTGCAAGTACCATGGCGAGTCTCACAAGACTTGTTTCAGCCCCCTTCACTTCTGTCATCAGACCCAGCGCAATTAGGGAACCTTCCATTATTTTCTTCTAGAGTTAGTCCTACTCCACCCGTGGATGTCCTTGTTGCAGCTGGAGACAACACAAAATCTCATGATCGATATTTAGGCAGAGATTCCTTTGCTAATGCTGCTGCAAGGGTTGGGCCGGCTGTTGTCCATATATCAGTTGCTCAGGTTGGTCCTTATGGAATTAATGCCGGATCAAATATGGGCTCGGGAACGATTATTGATGCAGATGGTACTATATTGACCTGTGCTCACCTTGTAGTTGGTTCTCATGGCGTACGAGGATTGTCCAAGGGAAAGGTTGATGTTACTTTACAAGATGGTCGGACATTTGAGGCTAAAGTGTTGAATGCTGATTTACATTCTGATATTGCAATTATAAAGATCAATTCCAAAACTCCTCTTCCAACTGCAAAATGTGGTTCTTCAAGCAAGCTTCGTCCTGGTGATTGGGTTATAGCCTTGGGCTGTCCTCTTTCCCTTCAGAACACTGTAACAGCTGGTATTATAAGCTGTGTTGATCGAAAAAGCAGTGATTTGGGCCTTGCTGGAATGCATAGAGAGTACTTGCAAACAGACTGTGCAATCAATCCAGGAAATTCTGGTGGTCCCCTTGTGAATATTGATGGAGAAATTGTGGGTGTTAATATTATGAAAAGATTAGCTGCAGATGGATTAGGTTTTGCTGTACCAATCGATGCAGTTTCCAAAATCATGGAGCAGTTCAAGAGGAATGGAAGAGTTGTCCGGCCTTGGCTTGGATTGAAAATGGTAGATCTTGATGGCATGATGATTGCCCAGCTCAAAGAAAGAGATGCTTCATTCCCCAATATTGAGAAAGGTGTTCTTGTAGCTATGGTAACTCCAGGGTCCCCTGCTGATCGTGCTGGGTTCCGTCTGCGTGATGTCGTAATTGAATTTGACGGGAAGCCAGTTGAAAGCATCAAGGAGATCATCAAAATAATGGGTGACAGAACCGGGAAACCCATGAAGGTATTCGTGACAAGAGCTAACAATGATTCAGTAACTTTGACTGTAATTCCAGAGGAAGCCAATCCAGACATGTGA
Protein:  
MSYFLKRALLSSPRNSSLIPVVAVAVAGSGLFYAKSNPDSFTRVSLLIPAPFLESLQVPWRVSQDLFQPPSLLSSDPAQLGNLPLFSSRVSPTPPVDVLVAAGDNTKSHDRYLGRDSFANAAARVGPAVVHISVAQVGPYGINAGSNMGSGTIIDADGTILTCAHLVVGSHGVRGLSKGKVDVTLQDGRTFEAKVLNADLHSDIAIIKINSKTPLPTAKCGSSSKLRPGDWVIALGCPLSLQNTVTAGIISCVDRKSSDLGLAGMHREYLQTDCAINPGNSGGPLVNIDGEIVGVNIMKRLAADGLGFAVPIDAVSKIMEQFKRNGRVVRPWLGLKMVDLDGMMIAQLKERDASFPNIEKGVLVAMVTPGSPADRAGFRLRDVVIEFDGKPVESIKEIIKIMGDRTGKPMKVFVTRANNDSVTLTVIPEEANPDM